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The  Weak  Representation  of 
Intermediate  Order  Statistics 

by 

Vernon  Watts 

ABSTRACT 

It  is  shown  that  under  certain  conditions  a sample  intermediate  order 
statistic  of  a sequence  of  random  variables  has  a representation  in  terms  of 
the  empirical  distribution  function  and  a remainder  which  is  stochastically 
of  small  order.  The  methods  employed  yield  readily  to  the  general  setting 
of  sequences  which  are  m-dependent  and  not  necessarily  identically  distributed. 
Under  the  assumptions  it  follows  from  the  representation  that  the  sequence  of 
intermediate  order  statistics  is  asymptotically  normal. 
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1.  Introduction.  In  [5]  and  [6]  we  showed  that  under  certain  conditions 
a sample  intermediate  order  statistic  from  an  independent  and  identically 
distributed  sequence  of  random  variables  has  an  almost  sure  representation  in 
terms  of  the  empirical  distribution  function  and  a remainder  of  suitably  small 
order.  The  methods  we  used  were  parallel  to  those  of  Bahadur  [1],  who  estab- 
lished the  corresponding  representation  for  sample  A -quantiles. 

In  this  paper  we  show  that  similar  representations  for  intermediate  order 
statistics  can  be  obtained  under  fewer  assumptions.  However  the  conclusions 
are  weaker  in  that  the  convergence  of  the  remainder  term  is  "in  probability" 
rather  than  almost  sure,  and  moreover,  a less  precise  order  of  the  remainder 
term  is  found.  Our  procedures  follow  those  used  by  Ghosh  [3]  to  extend  the 
sample  A-quantile  representation  to  sequences  satisfying  conditions  weaker 
than  those  of  Bahadur. 

Without  loss  of  convenience  we  achieve  a degree  of  generality  by  relaxing 
the  requirements  of  independence  and  identical  distributions.  In  place  of  the 
former  we  use  the  condition  of  m-dependence,  and  instead  of  the  latter  we 
allow  non-identical  distributions  which  are  in  a sense  uniformly  close. 

2.  Preliminary  notation.  Let  be  an  m-dependent  sequence  of 

random  variables  on  the  probability  space  (£1,  F,  P) , with  marginal  distribution 

functions  Fn00  * P(*n  s x) . By  m-dependence  we  mean  that  F(Xj,  ...,  Xn)  and 

F(Xn+^,  •••)  are  independent  o-fields  if  t > m,  where  F(*)  is  generated 

by  the  indicated  variables.  Let  {kn>n2j  be  an  intermediate  rank  sequence  of 

integers,  that  is,  1 s kn  s n and  kn/n  -*■  0,  and  denote  by  {X^n^}  the  corresponding 

n 

sequence  of  intermediate  order  statistics.  For  each  n i 1,  let  xQ  a -»  be 
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such  that  x = inf{x:  F (x)  > 0}  and  F (x  ) = 0;  equivalently,  x is  the  left 

J n no  0 

endpoint  of  F and  F assigns  zero  probability  to  x , for  each  n.  Now  define 
n n 0 


Then  for  n £ 1 and  i * 1,  . ..,  n,  let  x . and  x be  such  that 

n,i  n 


Fi(xn.i)  ■ kn/n 


F(n)(in)  - Vn' 


Under  assumptions  to  be  imposed,  x . and  x will  be  uniquely  defined  for  all 

n,i  n 

sufficiently  large  n. 

In  the  following  two  sections  we  develop  the  results,  examining  separately 
the  cases  when  Xq  > and  Xq  = 


3.  The  finite  endpoint  case.  Throughout  this  section  we  assume  that 

> “*•  Suppose  that  in  some  interval  (x„,  x„  ♦ 6),  each  F has  a continuous 

n 

derivative  F'  such  that  lim  F'(x)  exists  and  is  positive.  Assume  further  that 

x+x„ 


(3.1) 


Xn,i  - xn  3 °<kn  /n) 


uniformly  in  i > 1,  n,  as  n + « (which  implies  that  x -*■  x0),  and  that 

n j 


(3.2) 


lk»  n’1  [J.,  FJ(xn)  - U»  F!  (xn)  >0. 


(The  limit  is  assumed  to  exist.)  Finally,  assume  that 


lim  k /o  < «, 
n n n 


(3.3) 


3 


where 


(3.4)  an  - Var I (X.  s x^)) 

= k Cl  - k /n)  ♦ 2 l (P(X.  s x X.  s x ,) 
isi<jsn  1 n>1  5 n'j 

Then  we  have  the  following  result. 


(ynn, 


Theorem  1.  Under  the  above  conditions  (3.1)  - (3.3), 

(3.5)  x'">  . i . N.'  ^-1  UXi  * ,n,l)  . R 

n - ^ ^ n 


" F(n)  <xn> 


where  n R /k  ^ -£->  0. 
n n 


Proof:  We  first  assume  that  the  {X^}  are  independent  and  later  indicate  the 
modifications  to  be  made  for  the  more  general  m-dependent  case. 

For  any  real  t we  have  for  sufficiently  large  n, 

P(n(xkR)  - v/k„* 5 *>  ■ pciJ.1  ic*,  5 ;n  * tkn*/n)  s kn> 

■ • tk„*/n) 

- I(X.  < X + tk  ^/n)) 
l n n J} 

- k, 


(3.6) 

say.  Since  F(n)  ♦ tkn*/n) 
that  is  n ♦ 


P('t.n  s V- 


V * tkn*p(n)  (*n)/n  * °(kn*/n).  it  follow 


(3.7)  tn  . O/k^^Htk^j^)/,  . o(kn*/„))  . t. 


In)) 


Suppose  t > 0. 


Let 


",  ■ <Vk>M<\”  Lv  - I(Xi  5 Vi”- 

~ k 

By  (3.1)  we  have  for  large  n that  x . < x + tk  /n,  so  that 

n,i  n n 

(3.8)  W - Z«.  = (1/k  .(x  ))  I?  ,(I(x  . < X.  s tk  */n)  - p .), 

n t,n  n (n) v n " ti=lv  v n,i  l n *n,i" 

- k 

where  p . = F.(x  + tk  /n)  - k /n.  Since 
n,i  inn  n 

E(W  - Z„  )2  s (1/k  (F'  .(x  ))2)  I"  , p 
v n t,n'  v nv  (n)  v n"  1 *-1=1  *n,i 


and  since 


^i.l  Vi  ■ n F(n)(iin  * tkn%/n>  ' kn 


■ tkn*F(n)(in>  * °<k„%>> 

2 

it  follows  that  E(W  - Z ) + 0 as  n + •,  so  that 

n t f n 

(3.9)  Yl  - 1.  -2->  0. 

n t ,n 

Similarly,  (3.9)  holds  for  t s 0.  Also,  by  the  usual  central  limit  theorem 
for  row  sums  from  triangular  arrays,  it  is  easily  seen  that 


(3.10)  W N(0,  (lim  Ffn^(xn))'2), 

n v J 

so  in  particular  the  sequence  of  distributions  {(.(W^)}^  is  tight.  (-►  denotes 
convergence  in  distribution.) 

Let  V - n(x£n)  - i)/ k *.  Then  by  (3.6)  - (3.10)  we  see  that  V and  W 
n k n n n n 

n 

satisfy  the  conditions  of  Lemma  1 of  Ghosh  [3],  that  is,  the  collection 
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and 


lim  P(V  s t,  W 2 t ♦ e)  * 0 
n n n 


lim  P(Vn  ^ t + e,  Wr  s t)  = 0. 


Therefore  V - W -P-->  0.  Hence  for  R defined  by  (3.5), 

n T1  n ' ' ' r 

% 


n n n 

% _ _/-v(n)  _ £ i/v  % _ n /v 

k 

n 


n VV  " n<Xir  - y/V  - ^n  F(n)(*n»  £-l<Vn  " ^Xi  5 Vi” 


* V - W 
n n 


-2->  0. 


We  now  indicate  the  modifications  to  be  made  in  order  to  establish  the 
theorem  for  m-dependent  sequences.  We  need  to  show  that  the  sequence  (L(W  )} 
is  tight,  and 


(3.11) 


ECW„  - Zt.n>  * °' 


To  obtain  (3.11)  we  use  the  following  result  of  Rosen  [4]:  If  Y^ , .... 
are  m-dependent  (n  £ m)  and  have  finite  variances,  then 


(3.12) 


Var(Ii=l  V * (2m  + 1)  lniul  Var  Y.. 


Applying  this  to  (3.8)  yields 

e'b„  - zt,„>2  5 «2"  * »/Kn(Ffn)c;n))2)  rlml  pn>i. 

which  tends  to  zero  as  in  the  independent  case. 

In  order  to  show  that  {/.(W^)}  is  tight  we  employ  the  following  central 
limit  theorem  for  row  sums  from  triangular  arrays  with  m-dependent  rows 
(m  2 0 fixed),  due  to  Rosen  [4]:  Let  (X^  lsiSn,  with  d.f.'s 
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F . (x)  = P(X  . ^ x)  be  an  array  of  random  variables  with  X X 

n,iv  * v n,i  n,l  i 

. , X . . If 
..  1=1  n,i 

EX  . = 0 for  all  n,  i, 
n,i 

Var  S = 1 for  all  n, 
n 

lira  ?"  , I x2  dF  . (x)  = 0 fox  all  t > 0, 
n 1=1  lx|>c  "-1 

and 

lim  In  , Var  X . < », 
n ‘l=1 

then 

Sn  =>  N(0,  1). 

Now,  we  first  write 


(3.13)  »„  . (x,))  S X„,i> 

If  (3.3)  holds,  then  it  is  easily  shown  that  the  array 
1 s i £ n,  given  by 


k _/n)/o  . 
n n 


{X  .},  for  n large  and 
n,i  * 6 


X . = (I(X.  £ x .)  - 
n,i  v l n,i' 


kn/n)/V 


satisfies  the  assumptions  of  the  above  central  limit  theorem,  so  that  = 

X^  ^ =>  N(0,  1).  Then  using  (3.12)  again  we  see  that  °n^An  5 (2m  + 1)(1  +o(l)), 
and  it  is  clear  from  (3.13)  that  the  sequence  {(.(W^)}  is  tight  (but  not 
necessarily  weakly  convergent).  The  proof  of  the  theorem  is  now  complete.  0 
For  identically  distributed  sequences  (X  } we  may  replace  the  x ., 

n ii|i 

1 £ i £ n,  by  xn  = xn,  say,  and  thus  disregard  (3.1).  Additionally  in  this 
instance  the  assumption  (3.2)  is  redundant.  If  also  the  {X^}  are  independent. 
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then  (3.3)  is  trivially  satisfied  and  hence  we  have  a representation  weaker 
than  that  given  in  [5],  hut  using  fewer  assumptions  on  the  marginal  d.f.  and 
having  no  restriction  upon  the  intermediate  rank  sequence  {k^}.  Also,  for 
stationary  m-dependent  sequences  (m  £ 1),  (3.3)  becomes 

m kn(kn  + 2 2i=l(n  " ®<*Xl  * V Xi+1  5 V - (Vn)2))_1< 
n 

which  holds  if  (2/kn)  (n  - i)(P(X1  s xn>  Xi+1  s x ) - (k^/n)2)  is  bounded 

away  from  -1  for  all  large  n. 

To  conclude  this  section  we  point  out  that  the  asymptotic  normality  of 

X<n)  under  our  assumptions  easily  follows  as  a corollary  to  the  representation 
n 

(3.5).  This  may  be  seen  by  applying  the  above  quoted  central  limit  theorem 
along  with  the  relation  (3.3),  and  we  state  the  result  formally  as  a theorem. 

Theorem  2.  With  the  previous  notation  and  under  the  same  conditions  as  Theorem  1, 

<”  - *„>  -» N<°-  »• 

n 

4.  The  infinite  endpoint  case.  In  this  section  we  suppose  that  the  d.f.'s 

(Fn>  of  the  m-dependent  sequence  {X^}  have  the  common  left  endpoint  xQ  = 

The  assumptions  made  upon  the  marginal  d.f.'s  are  generalizations  of  those 

conditions  in  the  independent  and  identically  distributed  case  under  which  it 

is  known  that  X^  is  asymptotically  normal  (see  Cheng  [2]).  Specifically, 
n 

assume  first  that  there  is  a real  number  c such  that  F'(x)  and  F"(x)  exist 
for  x < c,  for  each  n 2 1.  Then,  with  the  previous  notation,  assume  that  there 
are  constants  p £ 1 and  M < ® such  that 
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(4.1)  Win>'^JPF;„)(in>  <M 

and 


C4-2)  I "(»)*„*  V I '<F  ;„)<*„» 2 

for  all  large  n,  where  {y^}  is  any  sequence  such  that  yn 
suppose  that 


< M 

= o(|xJP). 


Furthermore, 


i- 

(4.3)  xn  i - *n  = o(kn  /n  ^n)(xn))  uniformly  in  i - 1,  n, 

2 

and  finally  that  (3.3)  holds,  where  on  is  defined  by  (3.4). 


Theorem  3.  Let  (Xn)  be  m-dependent  with  d.f.'s  (Fn)  having  the  left  endpoint 
xQ  = -®  and  such  that  the  conditions  (4.1)  - (4.3)  and  (3.3)  are  satisfied. 
Then 


-(n)  _ 


n 


x + 
n 


5.i 


5 X 


n,i 


+ R 


n F(n)(V 


where  nFl.(x)R/k^  -E-> 
(n)  n'  n n 


0. 


Proof : We  proceed  as  in  the  proof  of  Theorem  1,  supposing  initially  that  the 
(Xn)  are  independent.  For  any  real  t, 

pee  r;n)(xn)/0<x'n)  - ;n) * t)  - p(z  * tn). 

n 

where  now 


t.n 


^i.l(Fi(in  * tC/,,F(n)(i'„» 


KX. * xn . knW;n)(xn))) 


and 
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ln  - V*  4 tk,>F („)<*„»  - V">- 


Let  tk^  /nF  (n)  (xn).  Since  p s 1 and  kn  -*  ®,  (4.1)  implies  that  yR  = o(xn) 

Thus  for  n sufficiently  large  a Taylor  expansion  gives 

FCn)(in  4 V ’ kn/n  4 tk„*/n  4 K*  TW\  4 W • 

where  1 0 I si.  It  then  follows  from  (4.2)  that  t -*■  t. 
n‘  n 

Let  t > 0.  Define 

"n  ’ ZJ-jV"  - I(Xi  S kn.i»' 

By  (4.3)  we  may  suppose  that  x . < x + y , so  that 

n j i n n 

\ - zt.n  ■ k*4  Iht'K.i ' h s \ 4 V - p„,i>* 


where 


pn.i  ’ Fi(k„  4 V - V"' 


Therefore 


E (W 


f - Z )'  s k '*  f , p . 
n t.n  n ^1=1  Kn,i 


(4.4) 


(n/k„)CF(n)(;n  . yn)  - kn/n) 


(4.4)  holds  similarly  for  t £ 0,  and  so  W - -£->  0 for  every  rtcl  t.  Also. 

n t,n  ’ 

Wn  N(0,  1).  Then  defining 


Vn  4 <lfin)<i1,>'kn*><xk")  4 V 

n 
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we  have  by  Lemma  1 of  Ghosh  [3]  that  V - W ^ > 0.  and  the  theorem  for  inde- 

n n 

pendent  sequences  follows.  Modifications  similar  to  those  made  in  the  finite 
endpoint  case  establish  the  theorem  for  m-dependent  sequences.  □ 

Just  as  in  Section  3,  simplifications  of  notation  and  assumptions  can  be 
made  for  identically  distributed  sequences,  but  we  do  not  restate  these  here. 
However  we  do  point  out  the  following  result  which  is  valid  in  the  general 
setting. 

Theorem  4.  Under  the  conditions  of  Theorem  3, 

<"  ' 5„>  - "O-  »• 
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